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Abstract 

A state-dependent relay channel is studied in which strictly causal channel state information is 

available at the relay and no state information is available at the source and destination. The source 

OO ' and the relay are connected via two unidirectional out-of-band orthogonal Unks of finite capacity, and a 

' state-dependent memoryless channel connects the source and the relay, on one side, and the destination, 

O ■ 

, on the other Via the orthogonal links, the source can convey information about the message to be 

C3 ' delivered to the destination to the relay while the relay can forward state information to the source. This 

exchange enables cooperation between the source and the relay on transmission of message and state 

information to the destination. First, two achievable schemes are proposed that exploit both message and 
• 1-^ 

, state cooperation. It is shown that a transmission scheme inspired by noisy network coding performs 

H ■ 

Cd ■ better than a strategy based on block Markov coding and backward decoding. Next, based on the given 

achievable schemes and appropriate upper bounds, capacity results are identified for some special cases. 
Finally, a Gaussian model is studied, along with corresponding numerical results that illuminate the 
relative merits of state and message cooperation. 



The work of M. Li and A. Yener was supported in part by the National Science Foundation under Grants CNS 0721445, 
CNS 0964364, CCF 0964362 and DARPA ITMANET Program under Grant W91 lNF-07-1-0028. The work of O. Simeone was 
supported in part by the National Science Foundation under Grant CCF 0914899. 

This work will be presented in part in Information Theory and Applications Workshop, 2011. 



2 



I. Introduction 

In a wireless network, main challenges to provide reliable communications include fading 
and interference. To establish the fundamental performance limits of such channels, a useful 
model is that the underlying channel is affected at each time instant by a state variable, which is 
controlled by a certain state distribution [|I]-[l3l. State-dependent channels are usually classified 
on the basis of the availability of channel state information at encoders and decoders. Specifically, 
transmitting nodes may have no state information, or else be informed about the state sequence in 
a strictly causal, causal, or non-causal way [|3l-[|21. Causality refers to whether the state sequence 
at a given time is known up to the previous instant (strictly causal state information), up to and 
including the current time instant (causal state information) or past, current and the future (non- 
causal state information). For decoders, it is enough to distinguish between the availability of 
state information or not flS]. 

While fading induced state variations are often measured at the receivers via training signals, 
leading to availability of the states at the destinations, interference induced state variations are 
not so. In particular, when the channel state models interference from other users, the state can 
be more effectively measured at nodes that are in the vicinity of the interferers, while nodes 
further away cannot directly measure the state. In this case, it may happen that nodes that are 
currently serving as transmitters may acquire state information, while the respective receivers 
may not. In this paper, we shall focus on such a scenario and analyze the performance trade-off 
arising from the need to convey both message and state information from transmitters to the 
receivers. 

In previous work, capacity-achieving strategies have been proposed for point-to-point memo- 
ry less channels with non-causal state information [|2l, [|6l and causal state information [HI at the 
encoder and no state information at the decoder. These results, and the ones discussed throughout 
the paper, assume that the state sequence is independently and identically distributed (i.i.d.). 
Several multi-user channels have also been widely investigated in similar settings including 
multiple access channels (MACs) JH-ini and relay channels [[I3l-[[l6l. References llTll-lfTTIl 
consider the MAC with different availability of non-causal or causal state information at some 
encoders. In lfT2l . message and state cooperation is considered for a MAC with conferencing 
links with non-causal state information available at the encoders and the decoder. For the relay 
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channel, reference [[T5l investigates the case of non-causal state information at the relay, and 
proposes a coding scheme that combines the strategies of decode-and-forward flTTl and precoding 
against the state, while reference [[131 studies the case of causal state information at the relay, 
and derives achievable rates by combining the ideas of compress-and-forward [[TTl and adapting 
input codewords to the state (also known as Shannon strategies [1]). 

This work also focuses on a state-dependent relay channel, but unlike previous work, assumes 
that state information is available only at the relay in a strictly causal fashion. This scenario is 
more relevant in practical scenarios since in practice the state can be learned only in a strictly 
causal way. For instance, in the case of an interference network, an interfering sequence can 
be leamed as it is observed, and, thus, in a strictly causal manner. With strictly causal state 
information, the strategies leveraged in [[131 . [[TSl . for example, of precoding against the state 
or Shannon strategies cannot be applied. More fundamentally, the question arises as to whether 
strictly causal, and thus outdated, state information may be useful at all in a memoryless channel 
with i.i.d. state sequence. In fact, it is well known that strictly causal state information is useless 
in point-to-point channels. This conclusion can be seen along the lines of argument for the fact 
that feedback does not increase the capacity for memoryless point-to-point channels in [fTSl . 

Recently, in [31, [[51, it was found that, in contrast to the case for point-to-point channels, 
for two-user MACs with independent or common state information available strictly causally 
at the encoders, capacity gains can be accrued by leveraging this information. Our recent work 
[[T9l further extended such results to MACs with arbitrary number of users by proposing a 
coding scheme inspired by noisy network coding [[20l . In [31, [[51, [fT9l . the main idea is to let 
each transmitter convey a compressed version of the outdated state information to the decoder, 
which in turn exploits this information to perform partially coherent decoding. The results show 
that the capacity region can be enlarged by allocating part of the transmission resources to the 
transmission of the compressed state. 

In this work, we consider a three-node relay channel where the source and relay are connected 
via two out-of-band orthogonal links of finite capacity, and a state-dependent memoryless channel 
connects the source and relay, on one side, and the destination, on the other. The source and 
destination have no state information, while the relay has access to the state information in a 
strictly causal manner. The channel model is shown in Fig. |2] This model is related to the class 
of relay channels, that are not state-dependent, with orthogonal links from the source to the 
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relay and from the source and relay to the destination investigated by El Gamal and Zahedi 
[|2n . In fact, in the scenario under study, we simplify the link from the source to the relay by 
modeling it as a noiseless finite-capacity link, while adding a similar backward relay-to- source 
link. Cooperation as enabled by orthogonal noiseless links, also referred to as conferencing, was 
first introduced by Willems [|22l for a two-user MAC channel and was later extended to several 
settings Il23l - ll25l . It is noted that, in practice, orthogonal links can be realized if nodes are 
connected via a number of different radio interfaces or wired links ll26l . 

As an example, our model fits a downlink communication scenario in a cellular network 
where femtocells are overlaid on a microcell as shown in Fig. [IJ Femtocells are served by home 
base stations, which are typically located around high user-density hot spots, that can serve as 
intermediate nodes or relays between users and the mobile operator network, to provide better 
indoor voice service or data delivery for stationary or low- mobility home users [|27l . The home 
base station is typically connected to the outdoor base station via an out-of-band wired link, e.g., 
a last-mile connection followed by the Internet. The home base station may be able to measure 
the interference created by outdoor users, whereas this may not be possible at the base station or 
at indoor users. This gives rise to the system model we consider in this paper, as can be readily 
observed from Figs. [T]and |2] 

In the considered model, cooperation between source and relay through the conferencing 
links can aim at two distinct goals: i) Message transmission: Through the source-to-relay link, 
the source can provide the relay with some information about the message to be conveyed to 
the destination, thus enabling message cooperation; ii) State transmission: Through the relay-to- 
source link, the relay can provide the source with some information about the state, thus enabling 
cooperative transmission of the state information to the destination. We propose two achievable 
schemes, one based on conventional block Markov coding [[TtI and backward decoding Il28l 
and one inspired by noisy network coding. We show that the latter outperforms the former in 
general. Moreover, based on these achievable rates, we identify capacity results for some special 
cases of the considered model. We also investigate the optimal capacity allocation between the 
source-to-relay and relay-to- source links where the total conferencing capacity is fixed. Finally, 
we derive achievable rates and some capacity results for the Gaussian version of the system at 
hand and elaborate on numerical results. 

The remainder of this paper is organized as follows. Section HI] formally describes the relay 
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Fig. 1. The downlink transmission to a home user in a femtocell provides an example application of the considered model 
illustrated in Fig. |2] The home base station is assumed to be able to measure the interference from outdoor users. 

model considered in this work. Section IlTlillustrates two different achievable coding schemes and 
presents the resulting achievable rates. Section |IV] identifies capacity results for some special 
cases. Section |V] studies the scenario in which the total conferencing capacity is fixed and 
elaborates on optimal capacity allocation. Section |Vl] studies the Gaussian case of our model 
and provides numerical results. Section IVIII concludes the work. 

Throughout the paper the following notation is used. Probability distributions are denoted 
by p subscripted by the random variables involved, e.g., px (x) is the probability of X = x, 
Py\x iy\x) is the conditional probability of Y = y given X = x, etc. We will drop subscripts 
from the probability functions when the meaning is clear from the context, e.g., p (x) stands 
for Px (x). Also X* denotes vector [xi, ...,Xi]. E [X] denotes the expectation of random variable 
X. A/'(0,cr^) denotes a zero-mean Gaussian distribution with variance a^. C{x) is defined as 

C(x) = il0g2(l+X). 

II. System Model 

In this section, we present the channel model and provide relevant definitions. As depicted in 
Fig. [21 we study a three-node relay channel where the source and relay are connected via two 
unidirectional out-of-band orthogonal links of finite capacity, while there is a state-dependent 
memory less channel between the source and relay, on one side, and the destination, on the other. 
Note that the relay transmits and receives simultaneously over two orthogonal channels. 
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Fig. 2. A state-dependent relay channel with two unidirectional out-of-band orthogonal links. 

The channel is characterized by the tuple: 

{X X XR,S,y,p{s) ,p{y \s,x,xr),Csr,Crs) (1) 

with source input alphabet X, relay input alphabet Xr, destination output alphabet y and channel 
state alphabet S. The capacity per channel use of the source-to-relay and relay-to-source out- 
of-band, also known as conferencing [|22l . links are given by Csr, Crs respectively. The state 

n 

sequence is assumed to be i.i.d., i.e., p (s") = H P i^i)- The relay channel is discrete memoryless 

i=l 

(DM) in the sense that at any discrete time i = 1, n, we have 

p{yi\s\x\xR,y'~^) = p{yi\si,Xi,XR^i) . (2) 

We assume that state information is available to the relay in a strictly causal manner while there 
is no state information at the source and destination. 

Definition 1: Let W, uniformly distributed over the set W = [1 : 2"^], be the message sent 
by the source. A (2"^, n) code consists of: 

1) Conferencing codes: Conferencing mappings are defined as 

hsR,i ■■ w X r^-/ ^ rsR,i, (3) 

hRS,^ ■■ S'-' X 7^]^ -> TRS,i, (4) 

where © generates the ith symbol sent on the source-to-relay link based on the message 
and all symbols previously received on the relay-to- source link, while & generates the ith 
symbol sent on the relay-to- source link based on the strictly causal states and all symbols 
previously received on the source-to-relay link. Note that Tsra is the alphabet of the 
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conferencing message sent from the source to the relay, while Trs.j is the alphabet of the 
conferencing message sent from the relay to the source at time instant i, i = 1, n. Such 
mappings are permissible if the following capacity-conserving conditions are satisfied: 

1 " 

-y2^og,\TsR,\<CsR, (5) 

i=\ 
1 " 

Eloga \TrsA < Crs- (6) 

2) Encoder mappings at the source: 

Wxr^5^A'„V^ = l,...,n, (7) 

which generates the channel input at the source at time % based on the message and the 
information received from the relay up to and including time % on the relay-to-source link. 

3) Encoder mappings at the relay: 

/^,, : S'-^ X VsR ^ A'r,.,V^ = (8) 

which generates the channel input at the relay at time % based on the strictly causal state 
information and the information received from the source up to and including time i on 
the source-to-relay link. 

4) Decoder mapping at the destination: 

g-.y^ ^ W, (9) 

which produces the estimate of message at the destination based on the received sequences. 
The average probability of error, Pr(-E'), is defined by: 

Pr(E) = ^^Pr((^(y")7^w;|u; sent). (10) 

A rate R is achievable if there exists a sequence of codes (2"^, n) as defined above such that 
the probability of error Pr(£') — )■ as n — )■ oo. The capacity of this channel is the supremum 
of the set of all achievable rates. 

in. Achievable Schemes and Upper Bound 

In this section, we demonstrate two different coding schemes that exploit the potential benefits 
of message and state cooperation between source and relay. We also identify a simple upper 
bound on the capacity. 
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A. Scheme 1: Block-based Message and State Cooperation 

We first propose an achievable sclieme based on conventional block Markov coding and 
backward decoding. 

Proposition 1: For the DM state-dependent relay channel of Fig. |2l any non-negative rate 
smaller than Ri is achievable where 



Ri = maxmin 

Pi 



^ I{X;Y\Xji,V,U) + CsR, ^ 

nX,XR-Y\V)-nV-S\Y), 
yi{X,Xn;Y\V,U) + CsR + CRs-nV;S\Y) J 



(11) 



with the maximum taken over the distributions in the set of 

Vi = {p{v,u,s,x,XR,y) : p (s) p (v \s) p (u) p (x \u) p {xr\u) p (y \s, X, Xr)} (12) 

subject to the constraint: 

/ {Xr; Y\X,U) + min {Crs, I {X, U; Y)) >I{V;S\Y). (13) 

Sketch of Proof: The idea is to follow a natural block Markov strategy. Specifically, the mes- 
sage w is split by the source into (6 — 1) parts, {wi, Wb~i), wj E [l : 2"^^], j = 1, (6 — 1), 
which are transmitted over b blocks, each block consisting of n channel uses where n = ^, 
and m is the number of total channel uses. At the end of each block, the relay compresses 
the state sequence that has affected the channel over the block with the aim of conveying such 
information to the destination in the next block. Compression exploits the side information at the 
destination via Wyner-Ziv coding [|29l . Conferencing takes place before the beginning of each 
block. Specifically, through conferencing, before the jth block, the source conveys part of the 
message wj to the relay in order to enable message cooperation, while the relay sends part of 
the bin index produced by Wyner-Ziv coding [fTTl . [|29l to the source to enable cooperative state 
transmission. The exchange state and message information is sent cooperatively by the source 
and relay, while the remaining part of the message Wj is sent independently by the source and 
the remaining part of the bin index is sent by the relay alone. This strategy is referred to as 
block-based message and state cooperation. Decoding takes place by backward decoding [|28l . 
Specifically, starting from the last reception, the destination first retrieves the compressed state 
information for block (b — 1). After that, it performs coherent decoding to recover message 
Wb-i from the (6 — l)th block reception by exploiting the state information retrieved. Using the 
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decoded message, the destination turns to retrieve the compressed state information for block 
{b — 2), and then decodes the corresponding message Repeating this operation until back 

to the first block, the destination recovers all the messages over blocks. Details of the proof are 
provided in Appendix |Al ■ 
Remark 1: To interpret (fTTT) to (fT3] ) in light of the transmission strategy discussed above for 
scheme 1, we remark that V represents the compressed state information and U accounts for 
the codeword transmitted cooperatively by the source and relay, which conveys both state and 
message information they share. Bound (fT3] ) imposes that the Wyner-Ziv rate I(y;S\Y) is 
supported by the cooperative transmission of the source and relay, whose rate is limited by 
min {Crs, I {^i U; Y)) and the information sent independently from the relay / (X/j; Y \X, U). 
The mutual information terms in (fTTT ). and in particular the conditioning on V , account for the 
fact that the destination has information about the channel via the compressed state V , which 
allows for partial or complete coherent decoding. Moreover, the second and third term in (fTTT ) 
reflect the cost in terms of rate to be paid for the transmission of compressed state information. 
□ 



B. Scheme 2: Burst Message Cooperation and Block-based State Cooperation 

In this subsection, we propose a second transmission scheme inspired by noisy network coding 

m. 

Proposition 2: For the DM state-dependent relay channel of Fig. |2l any non-negative rate 
smaller than i?2 is achievable where 

^ I{X-Y\Xr,V,U) + Csr, ^ 

I (X, Xr, V;Y)-I {V; S \Xn, U) , (14) 
\^ I{X,Xn,V;Y\U) + CsR + CRs-nV;S\Xn,U) ^ 
with the maximum taken over the distributions in the set of 

V2 = {p{v,u,s,x,XR,y) ■.p{s)p{v \s,XR,u)p{u)p{x \u)p{xr \u)p{y\s,x,XR)} . (15) 



R 



2 = max mm 

P2 



Sketch of Proof: Inspired by the noisy network coding scheme in [|20|. the same message 
w, w E [l : 2"^^^] , is sent at the source over all b blocks of transmission with each consisting 
of n channel uses. Thus, unlike scheme 1 discussed above, here information exchange about 
the message between source and relay takes place only one at the beginning of the first block. 
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This way, the source shares part of the message w with the relay in order to enable message 
cooperation. As for the state, at the end of each block, the relay compresses the state sequence 
over the block without explicit Wyner-Ziv coding, that is, without binning ||20l . Exchange of 
state information between relay and source takes place before the beginning of each block as for 
scheme 1 proposed above. Source and relay cooperatively send the message and state information 
they share, while the source sends the remaining part of the message independently and the relay 
sends the remaining part of the compression index alone for each block. This transmission scheme 
is referred to as burst message cooperation and block-based state cooperation strategy. At the end 
of b blocks of transmission, the destination performs joint decoding over all blocks of reception 
without explicitly decoding the compressed state information as for the noisy network coding 
scheme EOll . Details of the proof are provided in Appendix |B] ■ 
Remark 2: To interpret ([14] ) to (fT5] ) in light of the transmission strategy discussed above and in 
comparison the one in scheme 1, we point out that, as in Remark [H V represents the compressed 
state information while U denotes for the common message and state information. Each mutual 
information term in (fT4]| . in particular the conditioning on V, has for a similar interpretation as 
explained in Remark [T] Unlike scheme 1, however, the compressed state V is generated without 
explicit Wyner-Ziv coding and without requiring correct decoding of the compressed state at the 
receiver. This fact, as detailed in the proof, makes it possible to choose V to be dependent of 
Xji, U and S, instead of only S in scheme 1. Moreover, the rate loss due to the need to convey 
state information can be smaller than / {V; S \ Y) in (fTT)) . as discussed in Proposition [3l Finally, 
since the decoding is implemented jointly without recovering all the compressed states correctly 
in scheme 2, there is no explicit additional constraint (flBT l. □ 

C. Comparison of Achievable Rates 

Based on the discussion above, we expect scheme 2 of Proposition [2] to outperform scheme 
1 of Proposition [T] This is shown by the following proposition. 

Proposition 3: R2 > Ri. 

Proof: We prove the results by showing that the three terms in dHJ are larger or equal 
than the ones in (fTTI) . This, coupled with the fact that the characterization of R2 does not 
have additional constraint (fT3T ) and with the more general distribution p (v \s, xr,u) allowed by 
scheme 2 over scheme 1 (which constrains the distribution as p{v \s)), is enough to conclude 
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the proof. Specifically, setting p{v\s,xn,u) = p{v\s) in V2, we have that: 

1) The first term in ([T4] ) is the same as the first term in ([Til . 

2) The second terms are also equal since 

I (X, Xr, V;Y)-I {V- S \Xr, U) (16) 

= nX,XR;Y\V) + nV;Y)-I {V; S \Xr, U) (17) 

= I{X,Xr-Y\V) + H{V\S)-H{V\Y) (18) 

= I{X,XR-Y\V)-I{y-S\Y), (19) 

where (fTST i is because V is independent of (U^Xr), and (fT9l) follows from the Markov 
chain V ^ S ^Y. 

3) The third term of (fT4] ) is larger or equal than the corresponding term in (fTTT ) since 

/ (X, Xr,V-Y\U) + Csr + Crs-I {V- S \Xr, U) (20) 

= I{X,Xr-Y\V,U) + Csr + Crs + I{V-Y\U)-I{V-S\Xr,U) (21) 

= I{X,Xr-Y\V,U) + Csr + Crs + H{V\S)-H{V\Y,U) (22) 

> J(X,X^;y|\/,[/) + C5ii + CR5 + i/(^|5)-i^(V|V') (23) 

= / (X, x^; r I f/ ) + C5ij + - / 5 1 y ) , (24) 

where (|22] ) is because V is independent of (f/, X/j), (|23l ) holds because conditioning 
reduces entropy, while (|24] | again follows from the Markov chain V ^ S ^ F. 

■ 

D. An Upper Bound 

Here we derive a simple upper bound. 

Proposition 4: For the DM state-dependent relay channel of Fig. |2l the capacity is upper 
bounded by 

Rupp = max min (/ (X, X^; Y) , / (X; F \Xr, S) + Csr) (25) 
with the maximum taken over the distributions in the set of 

T^upp = {p{s,x,XR,y) : p{s)p{x,XR)p{y \s,x,xr)} . (26) 
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Proof: The upper bound (|25T ) is essentially a cut-set bound Il30l . where the first term 
corresponds to the MAC cut between source-relay and destination, and the second is the cut 
between source and relay-destination. Given presence of the state sequence, calculation requires 
some care and is detailed below. 

For the first term, consider a genie-aided system in which the message is also provided to the 
relay and the state s'~^ is also provided to the source at time i. The system can be now seen as 
being point-to-point with inputs {X,Xr), output Y and with strictly causal state information. 
In this case, it is well known that state information does not increase capacity, which is given 
by the first term in (|25T ). The result can also be seen from the Fano' inequality ll30l as 



Rupp<-I{W;Y'') + en (27) 
n 

1 " 

<-Y,l{^-M^'''^)+^n (28) 

i=l 
1 " 

< - ^ ^0 + en (29) 

i=l 

1 " 

< - V/(X„X«,,;FO + en (30) 

1=1 

with e„ — 7- as n — 7- oo, where (|29l ) follows from the non-negativity of mutual information, 
(l30l) follows from the Markov chain (W^, F*"^) ^ (Xj,X/j,j) o Fj. This Markov chain can 
be seen as a consequence of the independence of Si and (VT, F*^^), and the Markov chain 
{W,Y'-')^{X„Xn„S,)^Y,. 

For the second term, consider another genie-aided system in which the perfect state information 
is provided to the destination. Then, by the Fano' inequality [[30l , we have 

Rupp < ^/ {W- Y\ S\ T^«) + e„ (31) 

= ^/(ly;F^T^^|^") + e„ (32) 

= -/ {W; 1^", T^ji) + -I {W; T^^ |^") + e„ (33) 
n n 

with e„ — > as n — > oo, where (132] ) holds because W and are independent and (1331 ) follows 

from the chain rule. Note that 

lnW-TSn\S'') 
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< \h {T^^) (34) 
1 

<-y^H{TsR,d (35) 



n . 
1=1 



< CsR, (36) 



where (1351 ) is due to the fact that conditioning reduces entropy and (1361) follows from the definition 
of permissible conferencing mapping given by Moreover, we have that 

h{W;Y^\S\TSj,) 
1 " 

= -J2H^'^^^ 5", TSn) (37) 

i=l 

= -f2l{W;Yi\Y^~\S\ T^^, X^) (38) 
n ^ — ' 

1 " 

(^^ 1^"'' ' ^b) -H{Y^\ W, Y'~\ 5" , T^"^, X^)] (39) 



n 

i=l 



1 " 

= - ^ [iJ (r, \Y'-\ ^" , T^"^,, X^) -H{Y,\ W, Y'~\ , T^^, X^, T^s, ^")] (40) 
i=i 

1 " 

< - V [i7 (r, - ^ Si)] (41) 

1 " 

= -V/(X,;F,|Xfi,„50, (42) 



n . 
1=1 



where (l38l) holds because ^ is a function of (T|^, S^~^), (|40l) holds because Tr^ j is a function 
of (T^]j^, S^^^) while is a function of (VT, T^^), (|4T]) follows from the memoryless property 
of the channel and the fact that conditioning reduces entropy. Overall, we have 

1 " 

Rupp ^ — / I i^ij Yi |Xr j, Si) + CsR + e„ (43) 
1=1 

with e„ — )■ as n — )■ oo. 

Finally, from (l30t and (1431) . the proof is concluded using the standard approach of introducing 
a time-sharing variable uniformly distributed in the set [1 : n] and then arguing that one can 
set Q to be constant without loss of optimality [|30l Ch.l5]. ■ 
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IV. Special Cases and Capacity Results 

In this section, we consider three special cases of the general model studied above, namely: i) 
No message and state cooperation, in which Csr = Crs = 0; ii) Message cooperation only, in 
which Csr > 0, Crs = 0; Hi) State cooperation only, in which Csr = 0, Crs > 0. We establish 
capacity results for a special class of channels for each case. 

A. No Message and State Cooperation 

Corollary 1: If Csr = Cus = 0, any non-negative rate smaller than R21 is achievable where 

R21 = maxmin (/ (X; Y \Xr,V) , I {X, Xr, V;Y) - I {V- S \Xr)) (44) 
'P21 

with the maximum taken over the distributions in the set of 

'P21 = {p{v,s,x,XR,y) : p (s) p {v \s, xr) p (x) p (xr) p (y \s, X, xr)} . (45) 

Proof: The achievable rate follows from R2 (fT4l) by setting Csr = Crs = and f/ = 0, 
since no information is shared between the source and relay. ■ 

This rate turns out to be optimal, i.e., capacity-achieving, for a special class of relay channels, 
which includes modulo-additive state-dependent relay channels, see Example [IJ 

Proposition 5: Let V21 denote the set of distributions defined by: 

"^21 = {pis,x,XR,y) : p (s) p (x) p (xr) p {y \s, X, xr)} . (46) 

If Csr = Crs = 0, 

H{Y\X,Xr,S) = 0, (47) 

and H {S \X,Xr,Y) = (48) 

are satisfied for all distributions in P21' then the capacity is given by: 

C21 = maxmin {H {Y \Xr, ^) , / (X, Xr; Y)) . (49) 

Proof: The achievability is straightforward by setting V = S and applying assumptions (l47l) 
and (|48l) when evaluating (l44l) . Specifically, we have 

/ (X; Y\Xr,S)=H {Y \Xr,S) , (50) 
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and 

I{X,XR,S;Y)-I{S;S\Xn) (51) 

= / (X, Y) + I {S; Y\X,Xr)-H [S] (52) 

= / (X, Xr- Y)-H{S |X, Xr, Y ) (53) 

= I{X,Xr-Y). (54) 

To obtain a converse result, we follow from (l25T l and note the fact that X and Xr must be 
independent since source and relay cannot cooperate when Csr = Crs = 0- Hence, the capacity 
is upper bounded by (|25T ) evaluated for some product input distribution p {x) p (xr). Overall, we 
have: 

C21 <I{X;Y \Xr, S) = H{Y \Xr, S) , (55) 
C2i<I{X,Xr;Y) (56) 

for some input distribution p{x)p{xr). The proof is concluded by maximizing the mutual 
information terms (1551 ) and (1561 ) over the same input distribution p (x)p (xr). ■ 

Remark 3: Achievability of the capacity (l46l)— (|49l) has been proved above via scheme 2. The 
same capacity result cannot be obtained by setting U = = S in Ri from scheme 1 of 
Proposition [H since we have the additional constraint / (Xr; Y \X) > H (S \ Y). This points to 
the advantage of noisy network coding-like strategy used by scheme 2. □ 

Remark 4: Condition (l47l) basically states that, when fixed X and Xr, there is no other 
source of uncertainty in the observation Y beside the state S. Condition (|48] ). instead, says that 
the state S is perfectly determined when Y, X and Xr are known. These conditions guarantee 
that providing information about the state directly reduces the uncertainty about the input X and 
Xr. The fact that the relay can increase the achievable rate up to / {X, Xr] Y) in (|49l ) can be 
interpreted in light of this fact since the relay signal Xr directly contributes to the achievable 
rate even though the relay is not aware of the message. This will be further discussed in Remark 
[T3] for a Gaussian model. □ 

Example 1: Consider a binary modulo-additive state-dependent relay channel defined by 

Y = X (BXr® S, (57) 
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where S ~ Bernoulli (ps) ■ Let us further impose the cost constraints on the source and relay 
codewords 

- V E [X,] < p, - V E [Xr,] < pr (58) 
n n ^-^ 

i=l 1=1 

with < p,Pr < \- Extending the capacity resuh of Proposition |5] to channels with cost 
constraints is straightforward and leads simply to limiting the set of feasible distributions (|46] | 
by imposing the constraints that E [X] < p and E [Xr\ < pr, see, e.g., OTIl . Therefore the 
capacity is given by: 

Cbin = min {Hi,{p), Hb{p*Pr*Ps) - Hf, (pj) , (59) 

where pi * p2 denotes the discrete convolution operation of two Bernoulli distributions with 
parameters pi and p2, i.e., pi * P2 = Pi {I - P2) + ^2(1 -pi), and Hb{p) = -plogaP - 
(1 -p) log2(l -p). 

As a specific numerical example, setting p = pr = 0.15 and ps = 0.1, we have Cbm = 0.4171. 
Note that without state information at the relay, the channel can be considered as a relay channel 
with reversely degraded components in [fTTI . In this case, the best rate achieved is given by [fTTl 
Theorem 2]: 

Cbin, no SI = max max I {X;Y \Xr = xr) (60) 

p(x) xu 

= H,{p*ps)-H,{ps) (61) 
= 0.2912. (62) 

Hence Cbin > Cbin, no sh which assesses the benefit of state information known at the relay even 
in a strictly causal manner. 

Remark 5: The channel discussed in Example [U has a close relationship with the modulo- 
additive state-dependent relay model considered by Aleksic, Razaghi and Yu in ll32ll . Therein, 
the relay observes a corrupted version of the noise (state) non-causally and has a separate and 
rate-limited digital link to communicate to the destination. For this class of channels, a compress- 
and-forward strategy is devised and shown to achieve capacity. Unlike ll32l . the relay obtains the 
state information noiselessly, strictly causally and the relay-to-destination link is non-orthogonal 
to the source-to-destination link. We have shown in Proposition [5] that in this case, the proposed 
scheme 2 achieves capacity. □ 
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B. Message Cooperation Only 

With Crs = 0, the model at hand is similar to the one studied in [[211 . where capacity was 
obtained for a state-independent channel in which a general noisy channel models the source- 
to-relay link. For this scenario, the optimal coding strategy was found to split the message into 
two parts, one decoded by the relay and sent cooperatively with the source to the destination and 
the other sent directly from the source to the destination. By setting S = V = % and Crs = 
in (fT4l) . we recover a special case of the capacity obtained in [|2T1l with noiseless source-to-relay 
link. 

For state-dependent channels, a general achievable rate can be identified through R2 in (fT4)) 
by setting Crs = 0. Moreover, when the source-to-relay capacity is large enough, we are able to 
characterize the capacity as follows. Notice that this capacity result holds for an arbitrary Crs, 
not necessary Crs = 0. 

Proposition 6: Let V22 denote the set of distributions defined by: 

T^22 = {P (s, X, XR,y) : p (s) p {x, xr) p {y \s, X, xr)} . (63) 

If CsR > max / {X,Xr] Y) and arbitrary Crs, the capacity C22 is given by: 

^22 

C22 = maxJ(X,X^;F), (64) 

''^22 

and is achieved by message cooperation only. 

Proof: When Csr > C22, the source can share a message w of rate C22 with the relay 
through the conferencing link. By setting U = X and V = $ in (O and removing redundant 
bounds, we establish the achievability part. The converse part follows directly from (l25T l. ■ 

Remark 6: The capacity identified above is the same as without any state information at 
the relay. This result implies that when the relay is cognizant of the entire message, message 
transmission always outperforms sending information about the channel states. In other words, no 
benefits can be reaped if the relay allocates part of its transmission resources to state forwarding. 
This can be seen as a consequence of the fact that in a point-to-point channel, no gain is possible 
by exploiting availability of strictly causal state information. □ 

Remark 7: The capacity result of Proposition |6] has been proved by using scheme 2 for 
achievability. However, it can also be obtained with scheme 1 of Proposition [T] by setting U = X 
and V = This may not be surprising since the two schemes differ most notably in the way 
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State information is processed at encoder and decoder, and the capacity result of Proposition |6] 
is achieved with full message cooperation. □ 

C. State Cooperation Only 

If CsR = 0, no cooperative message transmission is allowed. However, through the confer- 
encing link of capacity Cbs, cooperative state transmission between the relay and source is 
still feasible. A general achievable rate can be identified from R2 in (fT4)) by setting Csr = 0. 
Specifically, when Crs is large enough, we have the following corollary. 

Corollary 2: Let P23 denote the set of distributions defined by: 

^23 = {p{s,v,x,XR,y) ■.p{s)p{v\s,XR)p{x,XR)p{y\s,x,XR)}. (65) 

If Csr = and Crs > max/ (X^j; F), any non-negative rate smaller than i?23 is achievable 
where 

i?23 = maxmin (/ (X; Y\Xr,V)J (X, Xr, V-Y) - I {V- S \Xr)) . (66) 

'P23 

Proof: By setting Csr = and U = Xr in (fT4l) . the set of V2 is specialized to 7^23- Fix any 
input distribution in 7^23- The first term in the min function of (fT4)) is reduced to I {X; Y \Xr, V) 
and the second term is reduced to / (X, X^, V;Y) — I {V; S \Xr). For the third term, it becomes: 

/ (X, Xr, V- Y\Xr) + Crs -I{V;S \Xr) (67) 

= / (X, V; Y\Xr) + Crs -nV;S \Xr ) (68) 

> I (X, V;Y\Xr) + I {Xr; Y) - I {V; S \Xr) (69) 

= / (X, X^, V;Y)-I {V; S\Xr), (70) 

where the inequality (|69] ) follows from the assumption on Crs- Notice that the third term cannot 
be smaller than the second term, hence it is redundant. Therefore, we establish the achievable 
rate given by (l66l) . ■ 

The achievable rate (|66l) coincides with the upper bound (|25T ) for the special class of relay 
channels characterized by (|47T)— (l48l). 

Proposition 7: Let ^23 = '^23 as defined by dMl)- If Csr = 0, Crs > maxi {Xr;Y), and 

^23 

(l47l)— (l48l) are satisfied for all distributions in V23, then the capacity is given by: 

C23 = maxmin {H {Y \Xr,S) , / (X, Xr; Y)) . (71) 

^23 
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Proof: The result follows from Corollary |2] For the achievability, set = S* in (|66l) and 
apply assumptions (l47l) and (l48l) to obtain (TtTI) . by similar steps from (ISOl ) to (|54|) . The upper 
bounds follow from (|25T ) and note that the second bound therein is reduced to H {Y \Xr,S) 
under assumption (|47T l. ■ 

Remark 8: Achievability of the capacity (TtTI) has been proved above via scheme 2. It cannot 
be attained by scheme 1 because of the additional constraint required to support the transmission 
of compressed state information specified by (fTST ). □ 

Remark 9: Compared to the capacity result provided in Proposition |5] for the same class of 
channels (|47l)— (|48l). C23 is potentially larger because a general input distribution is admissible 
instead of the product input distribution due to state cooperation. The resulting cooperative gain 
will be further discussed for the Gaussian model in Section IVH □ 

Remark 10: The capacity result of Proposition |7] is derived for Csr = and is thus achieved 
by state cooperation only. Optimality of state cooperation only can also be concluded in some 
case when Csr > and thus message cooperation is possible. For instance, assume that 
H{Y\ Xr,S) > I{X,Xr;Y) for the distribution in V23 that maximizes dlB- Then it can 
be proved, following the same bounds used in Proposition Ul that the capacity of channels 
satisfying (l47l)— (l48l) is given by (l64l) and is achieved by state cooperation only. An instance of 
this scenario will be considered in Corollary @] □ 

V. Cooperation Strategies With Total Conferencing Capacity Fixed 

In the previous sections, we have studied system performance for given values of the link 
capacities Csr and Crs- Here, we briefly investigate the optimal capacity allocation between 
the source-to-relay and relay-to- source links where the total conferencing capacity is instead 
fixed as Csr^ + Crs = Csum- In particular, we compare the rates achievable when the entire 
capacity is allocated to message cooperation only {Csr = Cgum and Crs = 0), to state cooper- 
ation only {Csr = and Crs = Csum), or to a combination of message and state cooperation 
{Csr = Copt > and Crs = Csum — Copt)- We refer the achievable rates corresponding in the 
three cases above by scheme 2 as R2,m, R2,s and R2,ms respectively. 

Proposition 8: For scheme 2, when Csr + Crs = Csum is fixed, we have 

-^2,5" < R2,MS < R2,M- (72) 
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Proof: Fix any input distribution of the form (fTST ) in R2 of (fT4l) . The second term in the 
min function of (fT4l) is independent of both Csr and C/j^, hence it is independent of Cgum- The 
third term, when Csr + Crs = Csum is fixed, is the same no matter how one allocates Cgum 
between Csr and Crs- Finally, the first term increases with Csr- Since Csr cannot be greater 
than Csum, it is optimal to set Csr = Csum- It follows that R2,s < -^2,1/5 < ^2,m- ■ 
Remark 11: For scheme 2, it is optimal to allocate all conferencing resources for message 
forwarding, thereby leading to message cooperation only. In other words, state cooperation is 
generally not advantageous when utilizing this scheme if one can arbitrarily allocate the overall 
conferencing capacity. Notice that this may not be always possible, as for instance, in applications 
where the two conferencing links are unidirectional channels with fixed capacity, e.g., cables. 
Assessing a similar conclusion holds for scheme 1 seems to be more difficult and is left as an 
open problem. □ 

VI. Gaussian Model 

In this section, we study the Gaussian model depicted in Fig. |2l in which the destination 
output Yi at time instant i is related to the channel input Xi from the source, Xr^^ from the 
relay, and the channel state Si as 

Yi = + Xjii + Si + Zi, (73) 

where Si ~ J\f{0,Ps) and Zi ~ J\f(0,NQ), are i.i.d., mutually independent sequences. The 
channel inputs from the source and relay satisfy the following average power constraints 

Ij:^[x!]<p, Ij:^[xu<pr. (74) 

i=l i=l 

The conferencing operations, encoding and decoding functions are defined as in Definition [U 
except that the codewords are required to guarantee the input power constraints (|74] |. 

A. Achievable Rate 

First, we extend the rate dH) achievable by scheme 1 to the Gaussian model of (1731)— (1741). 
Proposition 9: For the Gaussian relay channel considered, scheme 1 achieves any non-negative 
rate smaller than Rf where 

Rf = max mm{Ai,A2,A3) (75) 

0<a<l 
0<^<1 

<^<Pq 
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with 



(1 -a)P 



^ (l-a)P+(l-/3)Pfi \ _^/^ (P + P^ + 2vWP^ + iVo)Ps 



+ + (78) 

where a, (3 are the power allocation coefficients at the source and relay respectively, Pq is the 
variance of compression noise selected at the relay and a is a threshold defined as: 

P5(P + P,, + 2Vc./jPPK + iVo) 

{P + Pn + 2./WPP-R + Ps + No) min (2^^^^ (l + - 1, '^'%':^ -) ' 

(79) 

Sketch of Proof: The result follows from (fTH— (fT3l) by choosing Gaussian input signals 
satisfying the power constraints. Explicitly, the signals are generated as follows. First, choose 
f/ ~ A/" (0, 1). Then, consider X = ^/^U + X, where < a < 1 and X ~ A/" (0, (1 - a) P), 
independent of U. Hence, X ~ TV (0, P). Similarly, set Xr = ^/WrU + Xr, where < /3 < 1 
and ~ A/" (0, (1 - /3) Pr), independent of and X. Hence X^^ ~ A/" (0, Pr) and E [XXr] = 
^Ja|3PPR. Next, set V = S* + Q with compression noise Q ~ A/" (0, Pg) for some Pq > cr. By 
standard techniques as in ll30l Ch.8 and 9], each mutual information term in (fTT)) and (fTJt can 
be explicitly evaluated, establishing the achievable rate given from (l75l) to (|79l ). ■ 

Next, we extend the rate ([14] ) achievable by scheme 2 to the Gaussian model of (|73])— (|74|). 

Proposition 10: For the Gaussian relay channel considered, scheme 2 achieves any non- 
negative rate smaller than P^ where 

= max min (Pi, P2, 53) (80) 



with 



0<Q<1 

0</3<l 
0<Po 
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^2 = 2 log2 
^3 = ^ log2 



P + Pr + 2^af3PPR + Ps + No 




(82) 




(l-a)P+(l-/3)P^ + P5 + iV( 




(83) 



where a, (3 are the power allocation coefficients at the source and relay respectively and Pq is 
the variance of compression noise selected at the relay. 

Sketch of Proof: We use the same variable definitions as in the proof of Proposition [9] with 
exception that Pq only needs to satisfy Pq > 0. Then we can explicitly evaluate each mutual 
information term in (fT4] | following standard techniques in [[30l Ch.8 and 9]. Details are omitted 
here for the sake of conciseness. ■ 
Remark 12: If the relay ignores the available state information, it only cooperates with the 
source in sending the message information and does not employ the relay-to-source conferencing 
link. An achievable rate corresponding to this situation can be found from (l80l)— (f83l) by setting 
Pq —7- oo, i.e., an infinite variance for the compression of the state information, and (3 = 1, i.e., 
the relay allocates all its power to message transmission. We thus obtain 



Notice that the rate is clearly independent of Crs- This rate will be later used for performance 
comparison. □ 

B. Special Cases and Capacity Results 

Now we focus on the special case where A'^o = for the Gaussian model of (l73Tl— (l74l). We 
first consider the case with no both message and state cooperation. 

Corollary 3: If Nq = and the conferencing links satisfy Csr = Crs = 0, the capacity is 
given by: 



Proof: Notice that the channel discussed here satisfies assumptions (|47])— (|48]) in Proposition 
[5l Hence, by extending the results therein to continuous alphabets and evaluating each term by 
the maximum entropy theorem [30], one can obtain the result claimed in this corollary. Note 
that when providing both S and to the destination, the channel from source to destination 




(84) 




(85) 
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is noiseless and hence the first bound in the min function of (|49l ) goes to infinity, and is thus 
redundant. ■ 

Remark 13: The capacity result indicates that strictly causal state information at the relay can 
provide power gain for the channel considered, even though the relay knows nothing about the 
message information intended for destination from the source. In fact, when Nq = 0, conveying 
state information from the relay to destination can be considered as equivalently sending part of 
message for the source, as previously discussed in Remark |4] 

To elaborate on this insight further, we sketch an alternative achievable scheme in which 
we explicitly split the message W from the source into two parts, W = (^^51,^^,2), with 
Wsi G [1 : 2"-^»i] and Ws2 e [l : 2"^=2j_ divide interval [-1,1] into 2""^'^ subintervals of 
equal length and map Wsi to the middle points, denoted by 9{Wsi), of those subintervals. In 
addition, we generate 2"^^^ i.i.d. sequences a;" with each component satisfying Xi ~ J\f (0,P), 
and map Ws2 to the sequences generated as {Ws2)- Assume that the source wishes to send 
{wsi,Ws2) to the destination. The communication happens in (n + 1) channel uses as follows. 
In the first channel use, the source sends out the middle point 9 (wgi) corresponding to message 
Wsi while the relay sends xr^i = 0. For the remaining n channel uses, the source sends out 
each component of x" (^^2) in order. While, for the relay, in the second channel use, it sends 
out a scaled version of the state of the previous channel use such that the power constraint is 
satisfied at the relay, i.e., xr^2 = At2Si, where /i2 is chosen such that xr^2 ~ A/'(0,Pr); For 
i > 3 channel uses, the relay sequentially forms the minimum mean squared error (MMSE) 
estimate E [si 1^2^^] with each = XR^k + s^, ^ k = 2, — 1, based on the available states 
s*"^ and sends out XR^i = Ui (si — E \si ""^j), where /ij is chosen such that XR^i ~ A/'(0,Pr). 
This way, at the end of transmission, the destination first decodes message Ws2 by treating 
the states and information sent by the relay as noise. Hence, as long as Rs2 < C ^ p^p^ j » 
Ws2 can be successfully recovered as n — 00. After decoding ^^2, subtracting {ws2) from 
the received signal, similar to the analysis of the feedback strategy for point-to-point additive 
Gaussian channels in [[3T1 . Il33l . one can show that Wsi can be successfully decoded at rate 
Rsi = C (^p^j by the state refinement transmission from the relay as — t- 00. Overall, rate 
R,, + Rs2 = C (g) + C [p^) = C = eg ,„„p is thus achieved for the source. It is 

noted that a similar feedback coding scheme can be found in flU to achieve the maximum rate 
for each user in a two-user MAC with common state information. □ 
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Next, we consider the optimality of state and message cooperation only following Proposition 
[6] and in 

Corollary 4: If iVo = and the conferencing links satisfy Cbs > C j^fiEag^^ j with 
arbitrary Csr, the capacity is given by: 

/P + P. + 2V7^A ,86) 



and is achieved by state cooperation only. Moreover, if A'^o = and the conferencing links satisfy 
CsB^ > C**^ with arbitrary Crs, the capacity is also given by (l86l) . and is attained by message 
cooperation only. 

Remark 14: Example 1 in dH implies that, if the source knows the state sequence as well, 
then the maximum rate is given by (l86l) . Corollary |4]then quantifies the minimum capacity Crs 
necessary for this result to be attained on the relay channel of Fig. |2] where the source is not 
given the state sequence. □ 

Proof: To prove achievability for the case when C^s > C^, we consider a scheme that uses 
only the relay-to-source conferencing link and perform no message cooperation so that we can 
equivalently set Csr = 0. Then, we can identify the result from Proposition |7]by simple extension 
to continuous alphabets and maximizing each term by the maximum entropy theorem [|30l . 
Alternatively, considering the achievable rate (|80T l by scheme 2 and setting Nq = 0,Csr = 0, 
we rewrite Bi to in the min function as follows: 

\ Ps+Pq / 

B'^ = c(^±f^^^^^), (88) 



P. 



s 



Ps 

Further, setting a ^ 1, (3 ^ 1 and Pq — )• such that B[ — )■ oo, and under the assumption that 
Crs > C (^P±PR+hlPPE'^ , we thus get = B'^ = C (^f±Mv™ j p^j. converse part, the 



upper bound (l25l) reduces to C"^ following from the maximum entropy theorem [|30l . 

Turning to the case when Csr > C^, for the achievable scheme, the relay simply ignores the 
state information, so that one can equivalently set Crs = 0, and fully cooperates with the source 
to transmit the message, so that one achieves rate (|84] ) with a = 1, which reduces to (l86l) under 
the given condition for Csr- ■ 
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From Corollary |4] we immediately have the following. 

Corollary 5: If Nq = 0, and both Crs and Csr are large enough, both state and message 
cooperation only are optimal and achieve the full cooperation bound (|86l) . Compared to the case 
without any cooperation of (1851 ). they both provide cooperative gain. 

Remark 15: If Csr is large enough, e.g., Csr > C^, scheme 1 can also achieve capacity, 
which is attained by setting Pg — oo and /3 = 1 in (l75l) similar to Remark \T2\ However, no 
matter how large Crs is, scheme 1 cannot achieve capacity if Csr = 0. This can be argued 
by considering the extreme case with Crs — > oo. Examining rate (l75l)— (|79l) of scheme 1, we 
notice that the third term in the min function is redundant due to Crs oo. With A'^o = and 
Csr = 0, the first two terms can be instead rewritten as 

^'i=C|^i^|. (90) 

Ps+Pq 

along with an additional constraint 

Pq> (y= !^ • (92) 

P + PR + 2^apPPR + Ps 

To achieve capacity C^, we need to set « — )■ 1, /3 — )■ 1 and Pq — )■ as discussed in Corollary |4] 
But, notice that Pq is always bounded below by a nonzero threshold, which implies that Pg — t- 
cannot be satisfied. Therefore, it can be concluded that scheme 1 cannot achieve capacity by state 
cooperation only no matter how large Crs is. Recall that in scheme 1, the additional constraint 
comes from the fact that the destination needs to decode the compressed state explicitly, as 
discussed in Remark [U Compared to this scheme, the advantages of scheme 2 come from joint 
decoding of message and compression indices. □ 



4 = C(^±^L±V^1, (91) 



C. Numerical Results and Discussions 

We now present some numerical results. We start from the special case with Nq = Q studied 
in Corollary |4] We first compare the performance of scheme 1 and scheme 2 for message 
cooperation only, i.e., Crs = 0. In Fig. [3l we plot the achievable rates versus conferencing 
capacity Csr- We also plot the rate R^^ gj in (f84l) that is achieved when the relay does not use 
the available side information. It can be seen that scheme 2 outperforms scheme 1 in general. 
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Fig. 3. Comparison of achievable rates between scheme 1 and scheme 2 for message cooperation only [Crs = 0, P = Pj? = 
Ps = l,iVo =0). 

consistently with Proposition [3l Moreover, if Csr is large enough, both schemes achieve the 
upper bound (l86l) and the optimal strategy is to let the relay ignore the state information as 
provided in Corollary |4l But this strategy is suboptimal for smaller Csb- The benefits of state 
transmission from the relay to the destination are thus clear from this example. 

Next, we consider state cooperation only, that is, Csr = 0, and compare the achievable rates 
for two schemes in Fig. |4] with the upper bound (1861 ). We also plot the achievable rate ^^^^ 
in (|85l) that is attained when the source transmits message only. The benefits of cooperative state 
transmission by the source are clear from the figure. Moreover, if Crs is large enough, scheme 
2 is seen to achieve the upper bound, as proved in Corollary S] Instead, scheme 1 cannot, as 
discussed in Remark \T5\ 

We now get further insights into system performance by letting Nq ^ 0. We set P = Pr = 
Ps = I and vary A^o such that the resulting signal-to-noise ratio, or interfered state-to-noise 
ratio, 7 = lOlogio (1/A^o) lies between [-5 : 30] dB. 

We focus on scheme 2 and consider message cooperation only, i.e., Crs = 0. Fig. [5] shows 
the rates achievable by scheme 2 and by the same scheme when the relay ignores the state 
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information (|84] ) versus 7. It can be seen that in general state transmission from the relay can 
provide rate improvement, as also shown in Fig. |3] With Csr increasing, the achievable rate 
increases until it saturates at the upper bound (|25T ) when Csb^ is large enough. For example, as 
shown in Fig. [51 when Csr = 1-2, the achievable rate overlaps with the upper bound. 

We now consider state cooperation only, that is, Csr = 0. Fig. [6] shows the rate achievable 
by scheme 2. The upper bound therein also refers to (l25l . It can be seen that cooperative state 
transmission by the source is general advantageous, as compared to the performance without 
cooperation, i.e., Crs = 0. However, unlike the case of message cooperation only, even if Crs 
is large enough, e.g., Crs = 100 in Fig. |6l the upper bound is not achievable in general. This is 
unlike the noiseless case studied in Fig. |4l due to the fact that noise makes the state information 
at the destination less valuable (see Remark l4l and [T3l). 

Finally, we consider the case when the total conferencing capacity Csr^ + Crs = Csum is 
fixed as discussed in Section |Vl Under this assumption, we have shown in Section |V] that for 
scheme 2, it is enough to devote all the capacity for message conferencing, thereby leading to 
message cooperation only. We corroborate this analytical result via a specific example in Fig. Ul 
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Dash line: relay transmits message only, 
Solid line: scheme 2, r| 
Line with circle: upper bound, R 



7(dB) 



Fig. 5. Comparison of achievable rates for message cooperation only by scheme 2 (Csr = {0.2, 0.5, 0.8, 1.2} , Crs = 0,P 
Pij = Ps = 1, 7 = 10 logio (dB) ) . 




Fig. 6. Comparison of achievable rates for state cooperation only by scheme 2 [Csr ~ 0, Crs ~ {0, 0.2, 0.5, 0.8, 100} , P 
PR = Ps = ln = lOlogio {l/No) (dB)). 



It can be seen that a combination of both message and state cooperation is able to provide 
rate improvements as compared to cooperation on state only, while message cooperation only is 
always optimal. 
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Fig. 7. Comparison of achievable rates for different cooperation strategies when the total conferencing capacity is fixed 

(C,™ = l,P = PR = Ps = l,'y = lOlogio (l/No) {dB)). 

VII. Conclusion 

In this work, we have focused a state-dependent relay channel where state information is 
available at the relay in a strictly causal fashion. Assuming that source and relay can communi- 
cate via conferencing links, cooperation is enabled for both transmission of message and state 
information to the destination. First, we have proposed two coding schemes that exploit both 
message and state cooperation. The coding scheme inspired by noisy network coding outperforms 
the more conventional strategy based on block Markov coding and backward decoding. Next, 
capacity results have been established for some special cases, including no cooperation, message 
cooperation only and state cooperation only for a class of channels. We have also elaborated 
on the issue of optimal capacity allocation between the source-to-relay and relay-to-source 
conferencing links. Finally, we have characterized achievable rates for the Gaussian model and 
obtained some capacity results. In general, our results point to the advantage of state information 
at the relay, despite it being known only strictly causally. This is unlike point-to-point channels. 
Moreover, for given conferencing capacities, both state and message cooperation can improve 
the achievable rate. 
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Appendix A 
Proof of Proposition [H 

Throughout the proof, for a joint probability distribution p (xy), the set of e-typical n-sequences 
according to p (xy) is denoted by A" (XY). When the distribution, with respect to which typical 
sequences are defined, is clear from the context, we will use A" for short. 

Now we present the achievable scheme. Consider b blocks of transmission. We randomly and 
independently generate codebooks for each block. 

• Codebook Generation: 
Fix a joint distribution 

p{s,v,u,x,XR,y) =p{s)p{v \s)p{u)p{x\u)p{xR \u)p{y \s,x,xr). 

Define rates R = Rc + Rp with < Rc < min {R, Csr), and R = Rc + Rp with < < 

min (^R, Crs^ ■ 

1) For each block j, j G [1 : &], generate 2"^^' i.i.d. sequences according to the 

n 

marginal probability mass function (PMF) p [v^^] = H p{vj,i) for the given p{v). 

i=l 

Index them as v"^ (Ij) with Ij G [l : 2"^"] . First partition the set [l : 2"-^"] into 2"-^= 
superbins of equal size with each containing 2'^(^^'~^'^) v'^ (Zj) codewords. Then further 
partition the codewords in each superbin into 2"^^ bins of equal size. Then each bin 
contains 2"(^^'~^) codewords. Index each superbin as Bgj {tc,j) while index each bin 
as {tc,j,tpj) with t^j G 1 : 2"^- , tpj G 1 : 2"-^^^ . 

2) For each block j, generate 2^^^"'^^"^ i.i.d. sequences n" according to p (u") = 

n 

Y[p{^j,i) for the given p{u). Index them as {wcj,tcj-i) with Wcj G [l : 2"-^'=] 



and tcj-i 



3) For each block j, for each m" {wcj, tcj-i), generate 2"^^ i.i.d. sequences x^ according 

n 



to the conditional PMF p (x" {wc,j-,tc,j-i)) = 11 P{^j,i (^c,j, ^cj-i) ) for the 
given p{x \u). Index them as x" {wp^ \wcj, tc,j~i) with Wpj G [l : 2"-^p]. 
4) For each block j, for each u" {wcj, tc,j-i), generate 2"^^ i.i.d. sequences ^ according 

n 

to the conditional PMF p {xrj (uicj, tc,j-i) ) = Y[P i^R,j,i i^c,j, ^cj-i) ) for 

i=i 

the given p {xr \u). Index them as (tpj-i |wc,j, ^cj-i) with tpj-i G 1 : 2"-^'' . 
Encoding: 

At the beginning of each block, through conferencing link Csr, the common message Wcj 



31 



can be perfectly conveyed to the relay as long as bnRc < hnCsR, which implies that 

Rc < CsR. (93) 

Similarly, the superbin index can be delivered to the source as long as 

Rc < Crs- (94) 

Then we have the following encoding operations: 

1) j = 1: To send wi = {wc,i,Wp^i) to the destination, the source sends out codeword 
^1 ("^P,! \'^c,i, 1) while the relay sends out codeword x]^ i (1 \wc,i, 1). 

2) j = [2 : b — 1]: Assume Wj = (wcj.Wpj) to be sent in the jth block. At the end of 
the (j — l)th block, the relay learns the entire state sequence, i.e., and looks for 
an index (compression index) such that (/j-i)) G A". If more than 
one such indices are found, choose the smallest one. If there is no such an index, 
choose an arbitrary index at random from [l : 2"^""]. Let (tcj-i, ^pj-i) be the bin 
index pair associated with t;"_^(/j_i). Then codeword {wpj \wc,j,tcj-i) is sent out 
by the source and codeword x^^ {tpj \ wcj,tcj^i) is sent out by the relay. 

3) j = b: No new message is sent at the source. Hence, the source sends out codeword 

(1 \l,tc,b-i) while the relay sends out codeword x^^ (ip,fe-i \^,tc,b-i)- 
• Decoding: 

At the end of b blocks of transmission, the destination performs backward decoding. It first 
retrieves the bin index pair (tc,b-i5 ^p,6-i) through reception of 6th block, then it decodes the 
compression index Ib-i by using the received signal y'^_-^ and finally it decodes the message 
(wc,b-i, for block {b — 1) using the compressed state information This 

decoding operation is repeated for all blocks back to the first. 

Specifically, the decoding procedure for message (wcj, Wpj) of block j is as follows. Assume 
that (uJcj+i, w^pj+i) are perfectly decoded from the previous estimate. Now the destination 
looks for an unique bin index pair {ic,j,ipj) such that 

i^Pd+i \wc,j+i, ic,j ) , {ipj \wcj+i, Lj ) , y^+i {wcj+i, icj) , yj+i) e (95) 

If there is none or more than one such bin index pairs found, the destination reports an 
error Once it finds such a {tcj,tpj), it looks for an unique compression index Ij such that 

(^"(^.•)'^;) e^r, (96) 
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and 



(97) 



If there is none or more than one such compression indices found, the destination reports an 



error. Once it finds such a Ij, the destination looks for an unique message Wj 
such that 



{x] {wpj \wc,j, tcj-1 ) , x%j {tpj.i \wcj, tcj-i ) , u] {wcj, tcj-i) , Vj (Ij) , y]) G (98) 



for some tc,i-i G 



1 : 2 



nRc 



Analysis of Probability of Error: 

Let Pr (Ej) denote the average probability of error for each block j as defined in (flOl) . To 
bound the overall probability of error, say Pe, without loss of generality (WLOG), assume 
{wcj, Wpj) = (1, 1) are sent for each block j. Also denote the compression index selected 
by the relay for each block by Lj^i and the corresponding bin index pair for each block 
by {Tcj-i,Tpj_i). Note that, following the chain rule, 



Pr 



Kj=l 



6-1 



<Pr (^b) + 5^Pr [E^ 



i=j+i 



(99) 



(100) 



where since there is no new message sent in the last block, we have Pr (Eh) = 0. In the 
following, we focus on Pr Ej f] E^ \, i.e., the probability of error conditioned on not 
having errors in block j + l,...,b for each block j, j = 1, — 1, and we show that 
PtIej f] eA ^ as n ^ oo if conditions (fTTSl) . (fn7])-(fTT9l). and (fT23l)-(fT25l) are 
satisfied. 

Define the encoding error event for each block as follows: 

E,,o = { {v] (l,) , s]) i V G [1 : 2"«^] } . (101) 
The error events correspond to decoding T^j and Tp j based on rule (|95] ) are given by: 

^.-1 = ^lll U ^^.12 U i^^,,13 U (102) 
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with 



E. 



i,i2 



E. 



i,i3 



for some tc j 7^ j 

for some tpj ^ Tpj 

for some tc,j 7^ ^cj, tpj- ^ Tpj 
The error events correspond to decoding Lj according to rule (|96] )— (|97l) are given by 

with 



E. 



i,i4 



(103) 
(104) 

(105) 

(106) 



■,22 



(107) 



E,,2i = {{vJiL,),y-)eA:}, (108) 
^j,22 = {(t^; eA^forsome I, ^ L,,l, e B,{T,^,,Tp^^)] . (109) 

The error events correspond to decoding message {wcj,Wpj) according to rule (|98T ) are 
given by: 

Ej,3 = Ej^3i [J Ej;s2 U Ej^3 



',33 



(110) 



with 



^.7,31 = <i 



Xj (1 Iw^c.j) tcj—l ) , Xflj (^P,j — 1 l^cj'; tcj — l ) 



for some 7^ l,tcj-i £ 



^i,32 = < 



for some Wpj 7^ l,tcj-i G 



^.7,33 = < 



(111) 



(112) 



>. (113) 



for some Wpj 7^ 1, Wcj 7^ 1, tcj-i £ 
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Hence by the union bound, 



Pr I Ej 



fl eA < Pr (i?,.o) + Pr (i?,, + Pr (i?,,^) + Pr (i?,,3) • (114) 
i=j+i ) 

1) By the covering lemma in OTI . Pr (-Ej,o) — )■ as long as 

R^>I{y;S) (115) 

for sufficiently large n. 

2) By the packing lemma in [31], Pr (-Ej,!) — )■ as long as 

R,<l[X,Xn.U-Y), (116) 
i?p</(X^-F|X,f/), (117) 
R,^R,<l{X,Xn.\J-Y) (118) 

for sufficiently large n. Note that bound dllSI) implies (11161) , hence (11161) is redundant. 

3) Pr (-E'j,2) as long as 

R^-R<l{y;Y) (119) 

for sufficiently large n. 

4) For Pr {Ej -^), following from the standard argument on joint typicality [[301 Theorem 
15.2.1], for each set of error events from ([1 1 1) to ([1 131 i. we have: 

Pr (E,-3i) < 2"«=2"(^=+^^)2-'^(^(^'^«'^'^l^)-^), (120) 
Pr(Ej32) < 2"^f (2^"('^(^''^l'^«'^'^)"'^ _^ 2"^p2""'^'^^^''^«''^l^'^^"^) 

_^ 2'^(-^^<:+^p)2^"('f(^''^«'^;'^l^)-^)), (121) 

Pr(£'j-33) < 2"(^'^+^p)2"(^'=+^^)2~"(-^(^'^-f*''^''*'l^)"') (122) 
for arbitrary e > and sufficiently large n. 

Note that / (X, Xr,U;Y\V) = I {X, Xr;Y\V) because U ^ (X, Xr, V) ^ Y 

3 

forms a Markov chain. Thus, Pr (Ej ^) < ^ Pr (Ej^^k) — t- as long as 

k=l 

Rp<nX-Y\XR,U,V), (123) 
Rp + Rp<IiX,Xn;Y\U,V), (124) 
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Rc + Rp + Rc + Rp<I {X, Xr-Y\V) (125) 

for sufficiently large n. 

Therefore if bounds (fTTsT) . (fn7]|-(fn9l). and (fT23l)-(fT25]) are satisfied, 



Pr I Ej 



b 



f] -Ef j — 7- for all j = 1, 6 — 1 and for sufficiently large n. 

i=j+i J 

Collecting bounds (fTTSl) . (fTT7l)-(fTT9l). and (fT23l)-(fT25l). along with (HI, dH), i? = R^+Rp, 
R = Rc + Rp, applying Fourier-Motzkin elimination llBTl Appendix D], and exploiting the 
fact that V ^ S -H- Y forms a Markov chain, we establish the achievable rate given by 
([IB-CS- 

Appendix B 
Proof of Proposition [2] 

Consider b blocks of transmission. We randomly and independently generate codebooks for 
each block. 

• Codebook Generation: 
Fix a joint distribution 

p{s,v,u,x,XR,y) =p{s)p{v \s,xr,u)p{u)p{x \u)p{xr \u)p{y \s,x,Xr). 

Define rates R = Rc + Rp with < Rc < min (i?, Csr), and Ry = Rc + Rp with < Rc < 

min {R^,Crs). 

1) For each block j, j G [1 : b], generate 2"(''^'="'"^'=) i.i.d. sequences according to 

n 

P {u") = Y[pi'^j,i) for the given p{u). Index them as {wc,tcj-i) with Wc G 

i=l 

[1 : 2"^^-] and G 1 : 2"-^^= . 

2) For each block j, for each m" (wc, ^cj-i), generate 2"''^p i.i.d. sequences according 

n 

to the conditional PMF p (x" = ^ p for the given p{x\u). Index them 

i=l 

as x" (wp |wc, tcj-i ) with Wp e [l : . 

3) For each block j, for each {wc, tc,j-i), generate 2"^*' i.i.d. sequences x^ ^ according 

n 

to the conditional PMF p {xrj \u") = Y[p i^RjA for the given p {xr \u). Index 

i=l 

them as Xr^ (tpj-i |ii;c,tcj-i) with tpj-i G 1 : 2"^^ . 

4) For each block j, for each (x^^ (tpj-i |tfc, ^cj-i ) , {uic, tc,j-i)), generate 2"^^" i.i.d. 
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sequences according to the conditional marginal PMF 



1=1 



for the given p{v\xr,u). Index them as v"- {tcj,tpj \tc,j-i,tpj-i,Wc) with tcj G 



1 : 2 



nRc 



and tpj E 



^ . 2'nRp 



Encoding: 

The source wishes to send the same message w = (wc, Wp) to the destination over all the 
blocks. At the beginning of the first block, through conferencing link Csr, the common 
message Wc can be perfectly conveyed to the relay as long as bnRc < bnCsR, which implies 
that 



Rr < C: 



SR- 



nRc 



(126) 



selected at the relay can always 



Similarly, the partial compression index tcj-i G 1:2 
be delivered to the source through the conferencing link Crs for each jth block as long as 

(127) 



Rc < Crs- 

Then we have the following encoding operations: 

1) j = 1: The source sends out {wp \wc, 1) while the relay sends out a;^ ^ (1 \wc, 1 ). 

2) j = [2 : b]: At the end of block (j — 1), the relay learns the entire state sequence, i.e., 
s^_i, and looks for a compression codeword associated with index (tc,j-i, ^pj-i) 
such that 



'j'—l, ^j^-l (^cj-l) tp,j~l |^c,j-2) tp,j~2: "^c) ) 



e a:. 



If more than one codewords are found, choose the first one in the list. If there 
is no such a codeword, choose an arbitrary one at random from the compression 
codebook. Then codeword x" {wp \wc,tcj-i) is sent out by the source and codeword 
^R,j Iwctcj-i) is sent out by the relay. 

Decoding: 

At the end of b blocks of transmission, the destination performs joint decoding over all 
blocks by looking for an unique message w = {wc, Wp) with Wc E [l : 2"^^^^] and Wp E 
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[1 : 2"^^p] such that: 



for all j = 1, 6 and some = (ti, ta, t;,) = {tc,i, tp,i, tc,2, ip,2, ^c,6, ^p.ft)- 
Analysis of Probability of Error: 

To bound the probability of error Fr{E), WLOG, assume {wc, Wp) = (1, 1) are sent 
blocks. Also denote the indices selected by the relay for each block by (Tcj_i, Tpj. 
Define the following encoding error events: 



(128) 



for all 



{vj {t,„ tp, |T,,,-_i, Tp,,_i, 1 ) , s], {Tp,^, |Te,,_i, l),u] (1, T,,,_i)) ^ A^, 

(129) 



1 : 2 



Define the following decoding events: 

n (^c) ^c,j) ^p,ji ^cj— 1) ^p,j—i) 

for some t'' = (tc,i, ic,2, ^p,2, ■■■,tc,b, tp,b) 
where each Aj {wc,Wp,tcj,tpj,tcj-i,tpj_i) is defined by: 

Aj (tfc, tfp, tc,jr') ^P,J) ^C,jr'— 1; tp,j — l) 

X] {Wp \Wc, tcj-i ) , (tpj-i \Wc, tcj-l ) , 

t;" (tc,j, ipj |^c,j-i, ^^c) , Wj {wc, tc,j-i) , 
Hence by the union bound, 



Pr (E) < Pr (Eo) + Pr H Eq') + Pr (J 

y(«)c,«'p)7^(i,i) 

< Pr (Eo) + Pr n Eo^) + ^ Pr 

1) By the covering lemma in ll3TI . Pr (Eq) — )■ as long as 

R,> I{V;S\Xn,U) 



(130) 



(131) 



(132) 



(133) 



(134) 



for sufficiently large n. 
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2) By the conditional joint typicality lemma in 11311 . Pr (^E^^ i) l-^o) ~^ for sufficiently 
large n. 

3) For J2 (^(«,,, we have 

= Pr I IJ Pi {wc, Wp, tcj, tpj, tcj-i, tpj^i) j (135) 

Wc7^1,Wpf^l \ t*" j=l / 

U!py tcj y tp j , tcj — ly tpj^l^ j (136) 

tUc^l.-iUp^l \j = l / 

b 

(U^C, W^p, ^C,jr; tpJ, tcj—ly ^p,J — l)) (138) 

lOc^l.'UJpT^l t*" i=2 

^ 2"*(^'=+^p)2"^(^'=+^p)2~"^^~^^(^(^'^«'^''^'-^^"^) (139) 

where (11361 ) holds by the union bound; (11371) holds due to the independence of 
codebook for each block and the memory less property of the channel; (|138|) follows 
from < Pr (Aj) < 1; and (fT39l) follows from the fact that 

Pr(A (w 711 f ■ f ■ f ■ , f ■ < 2-"(^('^'^«'^''^;'*')-^) 

for arbitrary e > and sufficiently large n when Wc ^ I and u'p 7^ 1, by the standard 
argument on joint typicality [|30l Theorem 15.2.1]. 

Note that / (X, Xr, V, U;Y) = I {X, Xr, V; Y) because U ^ (X, Xr, V)^Y forms 
a Markov chain. Thus ^ Pr (-E(«,^,top)) — )■ as long as 

Wc^l,Wp^l 

Rc + Rp< ^ (/ (X, Xr, V- Y)-e)- [r, + Rp) (140) 

for arbitrary e > and sufficiently large n. 
Setting 6 — 7- 00, we have 

Rc + Rp<I (X, Xr, V- Y) - [r, + i?p) . (141) 
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4) For J2 (-E'(ii.c,i))' following similar arguments from (|1351 l to (11381 ). we have 



b 

W'cT^l tb j = 2 

^ 2"^c2"*('^"+'^'')2""^''"^)(^(^'^«'^'^^"^) (143) 
where (11431 ) follows from the fact that 

for arbitrary e > and sufficiently large n when i^c 7^ 1, by the standard argument on 
joint typicality [|30l Theorem 15.2.1]. 



Thus Y. ^ as long as 

Rc < ^ (/ Xr, V- F) - e) - + Rp) (144) 

for arbitrary e > and sufficiently large n. 
Setting 6—7-00, we have 

R,<I (X, Xr, V- Y) - {r, + . (145) 

But notice that (I141|) implies this bound, hence it is redundant. 
5) For Pr (£'(1 .,.^)) . again following similar arguments from (11351 ) to (I1381 i. we have 

- E E E n p^ (^j' ^^'i' ^P'^' ^^j-i' ^^^^^ 

By the standard argument on joint typicality [|30l Theorem 15.2.1] for enumerations 
over all (tcj, tpj, tcj-i, ^p,j-i) given any fixed [wc = l,Wp^ 1), we have 

Pr {Aj (1, Wp, tc,j, tpj, tc,j-i, tpj^i)) (147) 

2-n{i{X;Y\Xn,v,u)-e)^ if tc,j-i = l,tp,j-i = 1 and for any tc,j,tpj 
< < 2-"(^(^'^«'^'^l^)-^), if = 1, tpj_i ^ 1 and for any t^j,tpj (148) 
2-n(iix,Xny;Y)-e)^ if tcj-i 7^ 1 and for any tpj.i,tc,j,tpj 
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= Qj (1, Wp, tc,j-i, tpj^i) (149) 

where the upper bound is dependent on (tc,j-i, ^pj-i) only, for arbitrary e > and 
sufficiently large n. Then we have 

Qj (1, Wp, tcj-i, tpj-i) (150) 

^ 2-n{i{X;Y\Xiiy,u)-t) _|_ 2"^^ 2~'^(-'^("^'^«'^'"*^l'^ + 2'^i^''~^^p)2^^''^'^^'^'^'^'^^~^^ 

(151) 

< 3 X 2-"(™"(ii'i2,i3)-e) (152) 
with 

h = nx-Y\XR,V,U), (153) 
l2 = /(X,X^,V^;y|f/)-^p, (154) 
Is = / (X, Xr, V- Y) - (i?, + Rp^ , (155) 

for arbitrary e > and sufficiently large n. 
Hence, 

6 

- zii] 1] n ^'^•J-i' ^pj-i) ^^^^^ 

= n (1. tpj-i) (157) 

Wpf^l tj, j=2tj_i 

^ 2nbRp2n(^Rc+Rp) ^{b~l)2~n{b-l)(min(li,l2,l3)-'^) (158) 

where (11571) holds because Qj (1, Wp, tcj-i, ^pj-i) is dependent on (tcj-i, tp,j-i) only 
when 7^ 1. 

Thus {E{i,wp)) as long as 



— ^ (mm (Ii, I2, 13) -e)--i^Re + Rp) - 



for arbitrary e > and sufficiently large n. 
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Setting 6 — )■ oo and n — oo, we have 




1 



nX;Y\Xn,V,U), 



(160) 



Rp < I3 



Rp < I2 



nX,Xn,V;Y\U)-Rp, 
I{X,Xii,V;Y)- (^R^ + RpY 



(161) 



(162) 



Again notice that (11411) implies (11621) . hence (11621 ) is redundant. 
Collecting all the necessary constraints (11261 ). (11271 ). (I134D . (11411) . (1 1601) and (I161D . com- 
bining with R = Rc + Rp, Ry = Rc + Rp, and applying Fourier-Motzkin elimination [[311 
Appendix D], we finally establish the achievable rate given by (fT4l)— (fTSi). 
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